CDS

Accession Number TCMCG078C24192
gbkey CDS
Protein Id KAG0492827.1
Location join(39942383..39942949,39944619..39944804,39945348..39945563,39945692..39945802,39952102..39952299,39952375..39952506,39954104..39954160,39954273..39954389,39955144..39955221,39960439..39960582,39961273..39962316)
Organism Vanilla planifolia
locus_tag HPP92_006225

Protein

Length 949aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000002.1
Definition hypothetical protein HPP92_006225 [Vanilla planifolia]
Locus_tag HPP92_006225

EGGNOG-MAPPER Annotation

COG_category U
Description AP-4 complex subunit epsilon
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K12400        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04142        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0005911        [VIEW IN EMBL-EBI]
GO:0009506        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0030054        [VIEW IN EMBL-EBI]
GO:0030117        [VIEW IN EMBL-EBI]
GO:0030119        [VIEW IN EMBL-EBI]
GO:0030124        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044425        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0048475        [VIEW IN EMBL-EBI]
GO:0055044        [VIEW IN EMBL-EBI]
GO:0098796        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGCTCCCAAGGCGGTTGGGGCCAGTCCAAGGAGTTCCTGGATCTGGTGAAGTCCATCGGCGAGGCCCGCTCCAAGGCGGAGGAGGACCGCATCGTTCTTCGCGAGATCGAGACTCTGAAGCGACGGATCGCGGAGCCAGACGTCACGCGACGCAAGATGAAGGAGTACATCGTACGTCTCGTCTATGTTGAGATGCTTGGCCATGATGCTTCCTTTGGGTATATTCATGCCGTGAAGATGACTCACGACGATAATGTTGTCCACAAACGTACTGGTTATCTTGCTGTGACGCTCTTCTTGAACGAGAATCACGATCTTATCATCCTCATTGTGAATACCATACAGAAGGATTTGAAGTCCGATAACTATTTGGTAGTTTCCGCTGCTCTGACGGCGGTGTGTAAGCTCATCAACGAGGAGACGATCCCAGCCGTGTTGCCACAGGTGGTGGAGCTCCTTGGGCATCCCAAGGAGGCTGTAAGGAAGAAGGCAGTCATGGCACTGCACCGGTTCTACCAGCGTTCACCAGCTTCAGTATCCCACCTCCTCTTACATTTCAGGAAGAGGCTTTGTGATGGTGATCCTGGAGTAATGGGTGCTGCACTATGTCCTATTTTTGATCTTATCACGGCTGATGTAAACTCATACAAGGATCTGGTTGTCAGTTTTGTGAGCATTCTTAAGCAAGTTGTTGAAAGAAGATTGCCCAAGTCATATGAATACCATCAAATGCCTGCTCCATTTCTTCAGGTTAAGTTACTTAAGATTCTTGCGTTGCTGGGTAGTGCGGATAAGCAAGCGAGTGGACACATGTACGCTGTACTGGGTGAGATATTTAGGAAGTGTGAAATGTCAAGCAACATTGGTAATGCTGTGCTCTATGAATGCATCTGCTGTGTCTCATCTATCCAGCCGAATACGAAGTTGCTAGATGCTGCTACTGAAGCAACTTCAAAATTTCTGAAGAGTGACAGTCATAATCTCAAATACATGGGAATTGATGCCCTTGGTCGACTGATTAAGATAAACCCTGATATTGCTGAGGATCATCAGCTGGCTGTTATTGATTGCTTGGAAGATTCTGATGATACTTTAAAGAGGAAGACCTTTGAGTTACTTTATAAAATGACAAAATCCACCAACGTTGAAGTCATAGTTGATCGGATGATTAATTATATGATTTCCATAAGCGATAAGCATTATAAAACTGAAATAGCATCACGTTGTGTTGAGCTTGCCGAACAATTTGCTCCAAGCAATCAATGGTTTATCCAGACTATGAATAAGATCTTTGAGCATGCTGGTGACGTAGTAAATGTCAAAGTGGCACACAATTTAATAAGGCTTATTGCTGAAGGATTTGGAGAAGATGACGACGGTGCAGATAGCCAATTAAGATCCTCAGCTGTTGACTCATATTTGCACATTCTTTCAGAACCAAAGCTTCCTTCCATTTTCTTGCAAGTCATATGCTGGGTGCTGGGAGAGTATGGTACCGCAGATGGGAAGTATTCTGCATCTTTTATTATTGGTAAAATTTGTGATGTTGCAGAGGCACATACAAATGACAGCACTGTTAAGGCTTATGCAATAACGAGTATCATGAAAGTTTGTGCATTTGAAATTGCTGCTGGAAGGAAGGTGGAAATGTTGCCCGAGTGTCAATCTTTAATCGACGAACTATTAGCTTCCCATTCAACTGATCTGCAGCAGCGTGCGTATGAGCTACAAGCTCTGTCGTGCTTGGATAGTCATGTTATTCAACATGTGATGCCCCCAGATGCTAGCTGCGAAGATGTTGAGGTTGATAAAACCTTGTCTTTCCTCGACGATTTTGTGCAAAAAGCGTTTGAGAAAGGCGCACAGCCCTACGTTCCTGAGAGCGAAAGGTCTGGTGTGTACGACATCAGCAGCTTTAGAAACCAATATCAACAAGAACAATCTGGGCATGGTCTCAGGTTTGAAGCTTATGAACTCCCCAAGTCCTTACCACCAACAAATATCCCCACAATTCTCCATCCCCTTCCATCCACCGATGTCGTCCCTGTTTTTGAACCATCACATTCACGACCAACCCATCAAACATCATCCGGTGTCGATGTTTCCTCGGACGTCGGAGTTAAGCTCAGGCTTGATGGTGTTCAGAGGAAGTGGGGCAGGCCAACCGAATTATCTTCTTCTTCAACCTCTGGCTCTGCAACTGAAAGTGCAGCAAATGGGTTCTCGCAATCTGATGGATGGAAGAATGCAGCTTCGCATCCCCGAGATTTCTCATCCGACAAGAGGAATCCACCACCGGTTGAAGTATCGACCGAGAAGCAGAGACTCGCTGCATCCCTGTTTGGTTCTTCAGCATCCAAATCAGAAAAGAAACCACCATCCACACGCAGCTCATCCAGGGCAAGTAATGCAAATTCTGTGAAGCCAACTTCTGCAACTCTTCCTACAGAACCTCCAAAGGAGAAAGGCTCTGCTTCCACACCTCCGCCTCCCGATTTACTTGACTTGGGCGAGTCATTTCCCTCGAGTCCTCCATCCGAAGACCCATTCAAGCAGCTTGAAGGGCTCATCGGACCAGACACTGCTCCCACCATCAATCCTCCTCTTACAGCAACCATTCCAAACACCGCAAACATCATCTCATCATACAGCGAAGCCACTTCGCCTGATCTCGGCACCGATTTCATTTCTCAATTCACCAAAACCTCTCACGGAGCTCATGGCATCAGCTCAGCAAAGAAGGGACCAAATTCTAGAGAAGCACTGGAGAAGGATGCCGTCGTAAGACACGTAGGTGTGACACCCACAGGTAATAATCCCAACCTGTTTAGAGATCTTCTGAGCTGA
Protein:  
MGSQGGWGQSKEFLDLVKSIGEARSKAEEDRIVLREIETLKRRIAEPDVTRRKMKEYIVRLVYVEMLGHDASFGYIHAVKMTHDDNVVHKRTGYLAVTLFLNENHDLIILIVNTIQKDLKSDNYLVVSAALTAVCKLINEETIPAVLPQVVELLGHPKEAVRKKAVMALHRFYQRSPASVSHLLLHFRKRLCDGDPGVMGAALCPIFDLITADVNSYKDLVVSFVSILKQVVERRLPKSYEYHQMPAPFLQVKLLKILALLGSADKQASGHMYAVLGEIFRKCEMSSNIGNAVLYECICCVSSIQPNTKLLDAATEATSKFLKSDSHNLKYMGIDALGRLIKINPDIAEDHQLAVIDCLEDSDDTLKRKTFELLYKMTKSTNVEVIVDRMINYMISISDKHYKTEIASRCVELAEQFAPSNQWFIQTMNKIFEHAGDVVNVKVAHNLIRLIAEGFGEDDDGADSQLRSSAVDSYLHILSEPKLPSIFLQVICWVLGEYGTADGKYSASFIIGKICDVAEAHTNDSTVKAYAITSIMKVCAFEIAAGRKVEMLPECQSLIDELLASHSTDLQQRAYELQALSCLDSHVIQHVMPPDASCEDVEVDKTLSFLDDFVQKAFEKGAQPYVPESERSGVYDISSFRNQYQQEQSGHGLRFEAYELPKSLPPTNIPTILHPLPSTDVVPVFEPSHSRPTHQTSSGVDVSSDVGVKLRLDGVQRKWGRPTELSSSSTSGSATESAANGFSQSDGWKNAASHPRDFSSDKRNPPPVEVSTEKQRLAASLFGSSASKSEKKPPSTRSSSRASNANSVKPTSATLPTEPPKEKGSASTPPPPDLLDLGESFPSSPPSEDPFKQLEGLIGPDTAPTINPPLTATIPNTANIISSYSEATSPDLGTDFISQFTKTSHGAHGISSAKKGPNSREALEKDAVVRHVGVTPTGNNPNLFRDLLS